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(54) Method and device for controlling data processing 



(57) The device according to the invention deter- 
mines conditions for a processing liable to be carried 
out on data in a file, by at least one input/output means. 

To this end, it has: 

a means of determining the semantics of the 
processing of the said data, and 



a configuration determination means adapted, with- 
out modifying the said data, to take into account the 
semantics of the said processing of the said data in 
order to determine the configuration of the pilot of 
the input/output means intended to implement this 
processing. 
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Description 

[0001] The present invention concerns a method and 
device for controlling the processing of an electronic 
document. It should be stated here that an electronic 
document consists of one or more electronic files. 
[0002] In the computer systems known at the present 
time, the peripherals generally have parameterizable 
operating modes whose parameters are fixed 

either at the time of manufacture of the peripheral, 
or, by default, by the central unit of the computer 
system, 

or by a dialogue between the user and this compu- 
ter system. 

[0003] The operating mode used does not then 
directly take account of the purpose of the processing, 
the nature of the data to be processed or the constraints 
in processing these data by this peripheral. 
[0004] In particular, in the case of a peripheral consist- 
ing of a printer, the constraints imposed by the printing 
of a document including images, for example the con- 
straints of the duration of printing, the restoration of 
shades or the quantity of information to be processed, 
are not directly taken into account by the computer sys- 
tems in order to fix the printing parameters. 
[0005] Certain file transfer protocols automatically 
take into account the form, binary or otherwise, of a file 
to be transferred in order to configure a file transmission 
means, without taking into account their semantics. 
[0006] Certain software packages, such as the "POW- 
ERPOINT" presentation software from the "MS 
OFFICE" software suite produced by the American 
company MICROSOFT, modify a file corresponding to a 
document currently being processed, according to mod- 
els or document media choices. This has the major 
drawback that the initial file is modified during the proce- 
dure. 

[0007] Certain operating systems, such as the one 
known by the name "WINDOWS 95", produced by the 
American company MICROSOFT, take into account 
user profiles referenced at least by a user name and/or 
a password, which gives rise to a default configuration 
of the system for a given user. In particular, this defines 
a default printer. A printer is then chosen without taking 
account of the origin or destination of documents which 
will be printed. 

[0008] The present invention sets out to remedy these 
drawbacks. 

[0009] For this purpose, the present invention relates, 
according to a first of its aspects, to a device for deter- 
mining conditions for processing liable to be carried out 
on data in a file, by at least one input/output means 
which modulates a physical quantity, characterised in 
that it has: 

a means of determining the semantics of the 



processing of the said data, and 
a configuration determination means adapted, with- 
out modifying said data, to take into account the 
semantics of said processing of said data in order 
5 to determine the configuration of the pilot of the 

input/output means designed to implement this 
processing. 

[001 0] Correlative! y, the present invention relates to a 
w method for determining conditions for processing liable 
to be carried out on data in a file, by at least one 
input/output means which modulates a physical quan- 
tity, characterised in that it has: 

15 - a step of determining the semantics of the process- 
ing of the said data, and 

a configuration determination step during which, 
without modifying said data, the semantics of said 
processing of said data taken into account in order 
20 to determine the configuration of the pilot of the 
input/output means designed to implement this 
processing. 

[0011] It should be noted here that the semantics of 
25 the processing of the file corresponds to the perception 
of the result of this processing. It therefore concerns: 

both the actual content of the file (a file principally 
consisting of an image, text, graphics, contrast, col- 
30 our, etc) 

and the destination of the file (display, copy, print- 
ing, etc) 

or the origin of the file (the software which gener- 
ates the data and the file, the author or the user of 
35 the file, etc). 

[001 2] The same file can therefore have successively 
different semantics and can give rise, according to the 
present invention, to different configurations of 
40 input/output means. 

[0013] Using different files having the same destina- 
tion can correspond to different semantics. The same 
applies to the same document used by different users. 
[0014] The actual content is therefore different from 
45 the apparent content obtained by analysing solely the 
header or the extension of the file name. 
[001 5] It should be noted here that the pilot configura- 
tion concerns both: 

so - a pilot of an input/output means, which is config- 
ured, and 

several pilots of such means, which are chosen and 
possibly which are configured. 

55 [001 6] It should be noted here that, in the state of the 
art known at the date of the present invention, a pilot is 
a software module which is situated in a computer sys- 
tem having a processor, a module dedicated to commu- 
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nication with another system having its operating 
software for effecting a set of elementary operations. 
[0017] The pilot translates a complex operation 
required by a high-level software application into a set of 
elementary operations which can be executed by the 
peripheral. 

[001 8] The configuration of a pilot amounts to the fix- 
ing of the values of each parameter of said translation. 
These parameters concern for example the translation 
of the font into a series of points able to be formed by 
the printer, or of the colour palette of the document into 
a colour palette of the printer. 

[0019] Thus, when the invention is, for example, 
applied to a document to be printed, in order to choose 
the correct printer configuration: 

the document is analysed in order to determine 
semantic characteristics thereof, 
a correct printer configuration is deduced there- 
from, and 

the printer is configured with the configuration data. 

[0020] It should be noted here that the printer config- 
uration concerns all the operating parameters, for both 
the choice of a paper tray and the functioning of the print 
head. 

[0021] By virtue of these provisions, the invention 
makes it possible: 

to improve the print chain when a document is 
printed on a printer; 

to choose the correct printer automatically; 

to choose the correct configuration of this printer 

automatically. 

[0022] According to particular characteristics: 

the processing is able to be carried out by at least 
two input/output means, and 
the pilot configuration used by the determination 
means includes the selection of the inpuVoutput 
means intended to implement the said processing. 

[0023] By virtue of these provisions, the device 
according to the invention allows the automatic selec- 
tion of the input/output means which is intended to 
implement the processing. 
[0024] According to particular characteristics: 

the device as briefly disclosed above has means of 
dialoguing with the user adapted to transmit ques- 
tions to the user and to receive information from 
him in response, and 

the configuration determination means is also 
adapted to take into account the information 
received, in response, from the user in order to 
determine the pilot configuration. 



[0025] By virtue of these provisions, supplementary 
information concerning the document can be captured 
by the user in order to remove any ambiguities or con- 
flicts concerning the processing of the data. 
s [0026] According to particular characteristics, the 
device as briefly disclosed above has a memory 
adapted to store information received, in response, from 
the user. 

[0027] By virtue of these provisions, the information 
w stored in memory make it possible, when a question 
which has already been posed to a user must be posed 
to him again: 
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either to avoid posing this question again, 

or to remind the user of a response which he has 

already made to this question, 

or to propose a response. 

[0028] According to particular characteristics, said 
memory is adapted also to store, associated with each 
item of information received: 

an item of information representing the user who 
supplied it; and/or 

an item of information representing the document 
concerned. 



[0029] By virtue of these provisions, the risk of error is 
limited when the response given the first time to a ques- 
30 tion is used under the same circumstances. 

[0030] According to particular characteristics, the 
device as briefly disclosed above has a pilot updating 
means adapted: 
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on the one hand to detect that a pilot of the 
input/output means intended to implement the said 
processing is not available or is not up to date in a 
memory, and 

on the other hand, to read the said pilot in another 
memory. 



[0031] By virtue of these provisions, when the pilot of 
the input/output means to be used is not available or is 
not up to date, for example in a memory of a computer, 
45 it is transmitted, for example remotely, from a database 
constituting said other memory. 
[0032] According to particular characteristics: 
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the device as briefly disclosed above has a means 
of checking the availability of input/output means 
adapted to transmit an item of information repre- 
senting unavailability for the processing of said 
data, when the means intended to process said 
data is not available for this purpose; and 
the determination means is adapted to take into 
account the unavailability information in order to 
determine the configuration of another input/output 
means able to implement said processing of said 
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data. 

[0033] By virtue of these provisions, when an 
input/output means is not available, another input/out- 
put means is selected and configured in order to imple- 
ment said processing of said data. 
[0034] The present invention relates, according to 
another of its aspects, to a method of determining the 
conditions of a processing liable to be carried out on 
data of a document, by at least one input/output means 
which uses a physical quantity, characterised in that it 
includes: 

an operation of determining at least two quantities 
related to the said document, 
an operation of estimating the content of this docu- 
ment, taking into account each said quantity, and 
a configuration determination operation during 
which the content of the document is taken into 
account in order to determine the configuration of 
the pilot of the input/output means intended to 
implement this processing. 

[0035] It should be stated here that: 

the content of a file corresponds to a format of the 
data which it contains. By way of example, this con- 
tent may include text, graphics (or drawings) and/or 
images; 

the pilot configuration concerns both: 

a pilot of an input/output means, which is con- 
figured, and 

several pilots of such means, which are chosen 
and possibly 

which are configured; and 

the approximate methods based on estimations 
form part of the methods referred to as heuristic in 
the literature and make reference to decision crite- 
ria, methods or principles used for choosing which 
of various alternatives is the most effective for 
achieving an aim. A heuristic is a compromise 
between on the one hand the need to use simple 
and/or rapid criteria and on the other hand to dis- 
criminate the best choice correctly. 

[0036] Thus the result of the choice can prove to be ill- 
suited to the processing to be carried out, unlike exact 
determination methods, which determine a content in a 
certain manner. 

[0037] The reader can refer to the book by Judea 
Pearl, "Heuristics, intelligent search strategies for com- 
puter problem solving", published by Addison -Wesley 
Publishing Company, for more information on the con- 
cepts of heuristics and approximate determination. 
[0038] It should be noted here that, in the state of the 
art known at the date of the present invention, a pilot is 



a software module which is found in a computer system 
including a processor, a module dedicated to communi- 
cation with another system having its operating soft- 
ware for performing a set of elementary operations, 
s [0039] The pilot translates a complex operation 
required by a high-level software application into a set of 
elementary operations which can be executed by the 
peripheral. 

[0040] Configuring a pilot amounts to fixing the value 
io oi each parameter of the said translation. These param- 
eters concern, for example, the translation of the char- 
acter font into a series of dots able to be formed by the 
printer, or of the document colour palette into a colour 
palette of the printer. 
15 [0041] Thus, when the invention is, for example, 
applied to a document to be printed, in order to choose 
the correct printer configuration: 

the document is analysed in order to determine 
20 quantities which correspond to it, such as the dura- 
tion of opening of the file, the number of pages 
which it represents, the degree of compression 
which rt is able to have with a given file compression 
algorithm, the number of numeric information items 
25 which it contains, 

an estimation of the content of the file is then 
deduced therefrom, text, graphics and/or images, 
at least one printer suited to this content is deter- 
mined, which is configured for printing all or part of 
30 this document, and 

the printer is configured with the configuration data. 

[0042] It should be stated here that the printer config- 
uration concerns all the operating parameters of the 
35 printer. 

[0043] In the case of the printing of a document, the 
invention makes it possible: 

to improve the print chain, and 
40 - automatically to choose the correct configuration of 
this printer. 

[0044] According to particular characteristics, the 
quantity determination operation includes an operation 
45 of scrutinising the data of the said document which are 
directly accessible to a user. 

[0045] By virtue of these provisions, the quantities 
considered represent the data which are directly acces- 
sible to the user. They are therefore more precise for 

so determining the content of the document. 

[0046] According to particular characteristics, the esti- 
mation operation is able to be carried out without having 
to use a software application able to make it possible to 
modify the content of the said document. 

55 [0047] By virtue of these provisions, the estimation 
time does not include any time for opening the docu- 
ment with a complex software application, it is therefore 
fast. 
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[0048] According to particular characteristics, the esti- 
mation operation includes an operation of reading at 
least one physical quantity in a file. 
[0049] By virtue of these provisions, obtaining this 
physical quantity* is particularly easy and fast 5 
[0050] According to particular characteristics, 

the processing is able to be carried out by at least 
two input/output means, and 

the pilot configuration determination operation 10 
includes an operation of selecting the input/output 
means intended to implement the said processing. 

[0051] By virtue of these provisions, the device 
according to the invention allows the automatic selec- 15 
tion of the input/output means which is intended to 
implement the processing and, in the case of printing, 
the printer which is most suited to all or part of the doc- 
ument under consideration. 

[0052] According to particular characteristics: 20 

one of the said quantities represents the number of 
pages of the document represented by the said 
document, and/or 

one of the said quantities represents the number of 25 
numeric information items in the said document. 

[0053] By virtue of these provisions: 

a ratio of the other quantity to that which represents 30 
the size of the document, by number of pages or by 
number of information items, can be produced, and 
thus the determination of the content of the docu- 
ment can be made independent of the number of 
pages of this document, and/or 35 
a ratio of the number of numeric information items 
necessary for the description of a page of the doc- 
ument can be calculated. This is because, in gen- 
eral, documents consisting exclusively of images 
(for example photographs) use a larger number of *o 
numeric data per page than those consisting of 
drawings, and the latter use more than the docu- 
ments consisting of text. This ratio therefore makes 
it possible to estimate the content of the document. 

45 

[0054] According to particular characteristics, one of 
the said quantities represents a time taken to open the 
said document by software able to modify the content 
thereof. 

[0055] By virtue of these provisions, the documents so 
which have been created by using this software or 
which can be modified by the use of this software, can 
be discriminated from the other documents. This is 
because, for equal numbers of numeric information 
items, the time of opening of the first is. at the time of the ss 
present invention, generally less than the time of open- 
ing of the second ones. 

[0056] According to particular characteristics: 



the ratio of the said quantities represents a degree 
of compression able to be obtained on the said doc- 
ument by using predetermined compression soft- 
ware, and/or 

one of the said quantities represents a time neces- 
sary for the compression of the said document by 
predetermined compression software. 

[0057] By virtue of these provisions, the content of the 
document can be determined since, in general, the best 
degrees of compression and the shortest compression 
times are obtained with documents whose content is 
formed by images and then with documents whose con- 
tent is formed by drawings and then with documents 
whose content is formed by text. 

[0058] According to particular characteristics, the 
method as briefly disclosed above includes an opera- 
tion of determining parts of the document and, for each 
of the said parts: 

an operation of determining at least two quantities 
related to the said part, 

an operation of estimating the content of the said 
part, taking into account each quantity related to 
the said part, and 

a configuration determination operation during 
which the content of the said part is taken into 
account in order to determine the configuration of 
the pilot of the input/output means intended to 
implement this processing on the said part. 

[0059] By virtue of these provisions, for the processing 
of each part, for example of each page, the choice of the 
input/output means and/or the configuration of the 
input/output pilot can be optimised for each part of the 
document to be processed. 

[0060] According to another aspect, the invention 
relates to a device for determining the conditions of a 
processing liable to be carried out on data of a docu- 
ment, by at least one input/output means which uses a 
physical quantity, characterised in that it has: 

a means of determining at least two quantities 

related to the said document, 

a means of estimating the content of this document, 

adapted to take into account each said quantity, 

and 

a configuration determination means adapted to 
take into account the content of the document in 
order to determine the configuration of the pilot of 
the input/output means intended to implement this 
processing. 

[0061 ] The invention also relates to a printer, a facsim- 
ile machine, facsimile card, a modem, a display screen, 
a photographic apparatus and a camera including a 
sensor, characterised in that they include a device as 
briefly disclosed above. 



5 



9 



EP 0 926 587 A1 



10 



[0062] According to another aspect, the invention also 
relates to: 

a computer program supported by an information 
medium, such as a diskette or a compact disc, 
characterised in that it implements the method of 
the invention as brief ly disclosed above; and 
a memory, such as a diskette or a compact disc 
intended to be read by a data processing system, 
characterised in that it supports program instruc- 
tions whose execution by said processing system 
implements the method of the invention as briefly 
disclosed above. 

[0063] The invention will be understood more clearly 
from a reading of the description which follows, given 
with regard to the accompanying drawings, in which: 

Figure 1 depicts a flow diagram of the print chain for 

a document according to the present invention; 

Figure 2 details part of Figure 1 ; 

Figure 3 details part of Figure 1 ; 

Figure 4 details part of Figure 1 ; 

Figure 5 depicts, in more detail, part of Figure 3; 

Figure 6 details part of Figure 1 ; 

Figure 7 details an operation in Figures 1 and 6; 

Figure 8 depicts a device adapted to implement the 

present invention; 

Figure 9 depicts a device adapted to implement the 
present invention (two printers, one a laser printer 
and one an ink jet printer, with two operating 
modes, monochrome and colour), 
Figures 1 0A and 10B depict a flow diagram for the 
operation of the device illustrated in Figure 9, and 
Figures 11A and 11 B depict flow diagrams for 
determining thresholds liable to be implemented in 
the flow diagram illustrated in Figure 10. 

[0064] In the first embodiment described and depicted 
with regards to figs 1 to 8, the file processing consists of 
an application of printing data represented by this file. 
The invention applies, however, also to any processing 
of data capture, data transmission, data display, data 
reading or data storage. 

[0065] In the remainder of the description, the term 
"knowledge" is employed to define an item of informa- 
tion which concerns several data items and which 
relates to their source, their destination or their process- 
ing, modification of an item of knowledge does not mod- 
ify the meaning of the data but relates to their 
presentation, that is to say to the manner in which they 
are made available, transmitted, captured, read or 
stored. 

[0066] In the example described and depicted, the 
items of knowledge, also referred to as "characteristics", 
related to a document which a user wishes to print, are, 
in particular: 



the user identifier, 

the choice between a printing of the "rough draft" 
type and a printing of the "final document" type, 
the choice between a transparent print medium and 
s a paper print medium, 

the colour of the medium, 

the fact that the proximity of the printer is required, 
the fact that the printing time is crucial 
the choice of the print format, 

10 - the required reduction ratio: 

the choice between printing on both sides and print- 
ing on the front face only, 
the proportion of text in the document, 
the proportion of image in the document, 

15 - the proportion of graphics in the document, 
the size of the document, 

the printing resolution necessary for the document. 

[0067] Figure 1 depicts operations which are imple- 
20 mented as soon as a user 103 requests a processing 
(here a printing) of a document 104 consisting of a file 
stored in a random access memory of the device of the 
invention, and coming from software of a known type, 
for example a spreadsheet, a drawing program or word- 
25 processor software: 

an operation 105 of interrogating the user (also 
referred to as a "user" interface) enabling the user 
103 to communicate with the device of the invention 

30 (a function partially detailed in Figure 2), 

a document analysis operation 1 06 which works on 
the document 104, detailed in Figures 3 and 5, 
a configuration determination operation 107, 
detailed in Figure 4 and acting on the one hand on 

35 the information given by the user 103 by known 
input means and on the other hand on the results of 
the document analysis operation 106 in order to 
determine on the one hand a printer adapted to the 
printing of the document and on the other hand the 

40 configuration of this printer, 

a printer pilot availability test 108 performed after 
the configuration determination operation, 

and, when the printer pilot considered is not 
45 available: 

a pilot updating operation 101 , detailed in Figure 7, 
a pilot installation operation 102, of a known type 
and, when the test 108 determines that the printer 
so pilot is available and the operation 102 has been 
performed: 

a pilot configuration operation 109, detailed in Fig- 
ure 6, and 

a printing operation 1 10. of a known type, enabling 
55 a printer 1 1 1 to supply a printed document 112. 

[0068] The user 103 uses known means (keyboard, 
mouse, touch screen, microphone, etc) for replying to 
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the questions posed by a computer (see Figure 8) which 
implements the present invention in the context of a 
user interrogation operation 105 or the configuration 
determination operation 107. 

[0069] A document database 808 (Figure 8) is struc- 
tured in a known fashion. It stores, for each document 
104 already printed, a quadruplet consisting of: 

the identifier of the document, 

a list 309 of characteristics or knowledge of this 

document, 

the printer corresponding to the last printing, and 
the printer pilot configuration corresponding to the 
last printing. 

[0070] The list of characteristics 309 includes the 
knowledge resulting from the analysis operation 106 
and the knowledge resulting from the dialogue 1 05 with 
the user 103, this knowledge being associated with an 
item of information identifying the user. 
[0071 ] The pilot availability test 1 08 consists of seek- 
ing, in the random access memory of the computer 
which implements the method of the present invention, 
whether the pilot of the printer selected, during the oper- 
ation 107, is available or not. 

[0072] The pilot configuration operation 109 consists 
of copying, in registers reserved for this purpose, values 
of information which represent operating parameters of 
a printer intended to print the document 104. 
[0073] The print operation 1 10 consists of sending to 
the printer 111, when it is configured, the data of the file 
representing the document to be printed 104. 
[0074] As a variant, the print operation 110 includes a 
printer availability test and, when the printer is not avail- 
able, the choice of a printer whose technical capabilities 
are such that, after the configuration of this new printer, 
the characteristics of this printed document will be close 
to the result expected by the user. 
[0075] To this end, the print operation 1 10 includes: 

a step of checking the availability of the input/output 
means selected, during which the availability of the 
input/output means (that is to say here of the 
printer) is checked and there is received, from this 
means, an item of information representing unavail- 
ability for the processing of said data, when it is not 
available for this purpose; 

the determination means takes the unavailability 
information into account in order to determine the 
configuration of another input/output means able to 
implement said processing of said data. 

[0076] In the embodiment described and depicted, in 
order to effect the choice of the other printer able to print 
the document 104. when the printer selected during 
operation 107 is not available, the print operation 110 
include the same steps as the operation 107 (see Fig- 
ures 2 and 4), knowing that another document corre- 



sponding to the unavailable printer is not considered in 
these steps, and next returns to operation 108. 
[0077] When the printer initially selected is not availa- 
ble, it is therefore, iteratively, the printer which, amongst 

5 those which remain, is capable of supplying the docu- 
ment which corresponds most closely to the wishes of 
the user, which is chosen, configured and used. 
[0078] Figure 2, which details part of Figure 1 , shows, 
for each question 201 posed to the user, a test 202 

io which determines whether a response has already 
been given to said question. 

[0079] It should be noted here that the questions 
posed to the user, during operation 105, concern the 
semantics of the processing of the document 104: 

15 

"rough draft or final document?" 
"printing medium transparent or paper?" 
"colour of medium?" 
"proximity of printer required?" 
20 - "printing time crucial?" 
"print format?" 
"reduction ratio required?" 
"printing on both sides or solely on front face ?" 

25 [0080] In addition, the identifier of the user 1 03 is auto- 
matically supplied at the time of starting up the device of 
the invention. 
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by recognition of the absence of a password, 

by the presence of a password which corresponds 

to a user, or 

by recognition of a memory card or any other 
known identification means, or 
through the response to an additional question con- 
cerning its identity. 



[0081 ] During the test 202, the processor of the com- 
puter 1 00 (Figure 8) gains access, in the document data 
base 808, to a memory register which corresponds on 

40 the one hand to the user and on the other hand to the 
question under consideration. When this register is 
empty, the result of test 202 is negative and operation 
204 is performed. When the register stores data repre- 
senting a possible response to said question, the result 

45 of test 202 is positive and test 203 is performed. 

[0082] Test 203 consists of transmitting to the user a 
question 'There already exists a stored response to this 
question. Should it be used?", stating the question con- 
sidered and the stored response. 

so [0083] When the user supplies a positive response, 
operation 205 is performed. When the user supplies a 
negative response, operation 204, which consists of dis- 
playing the question under consideration on the screen 
of the computer 100, is performed. Operation 205 con- 

55 sists of receiving the response made by the user, either 
following operation 204, the response from the user 
then being effected by using an input interface, or fol- 
lowing the positive result of test 203, the response from 
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the user then being read in the register disclosed above. 
Operation 207 then consists of storing, in said register, 
this response in the file corresponding to the question 
posed. 

[0084] Operation 208 next consists of displaying the 5 
response on the screen of the computer 100. 
[0085] It will be understood that the use of the user 
interface illustrated in Figure 2 makes it possible to 
reduce the duration of capture of the responses, since 
the questions has already been posed in the same con- io 
text and the user desires to make the same response to 
it. 

[0086] As a variant, a single global question can be 
used in place of all the tests 203 which correspond to 
the successive questions: "use the response or is 
responses already known". The single response to this 
question then replaces each of the responses to tests 
203. 

[0087] Figure 3 depicts in more detail the document 
analysis step 106 (Figure 1). 20 
[0088] A document 1 04 being considered, a document 
origin determination operation 301 is performed (see 
Figure 5) in order to determine with which software the 
document 104 was generated. 

[0089] Next, an operation 302 consists of determining 2s 
a proportion of text in the file 104 under consideration. 
[0090] Next, an operation 303 consists of determining 
a proportion of image in the file 104 under considera- 
tion. 

[0091] Next, an operation 304 consists of determining 30 
a proportion of graphics in the file 104 under considera- 
tion. 

[0092] These operations 302 to 304 are performed by 
analysing the file (a so-called GDI file) which in fact con- 
tains references to the nature of the objects which it 35 
incorporates. 

[0093] In fact, in a known fashion, a page consists of 
a certain number of boxes and each box corresponds to 
an object of a certain type or coming from a certain 
application. For example, a given page can include 40 
simultaneously: 

"image" zones whose content is described, in the 
file, as an image, 

"text" zones, whose content is described, in the file, 45 
as text, and 

"graphics** zones whose content is described, in the 
file, as a graphic. 

[0094] Knowing the dimensions of each box, or zone, so 
and their type ("image", "text" or "graphic"), the proces- 
sor 100 (Figure 8) determines the proportions of these 
three types of zone in the document 104 to be printed. 
[0095] Next, an operation 305 determines document 
size information, in the form of a pair representing on ss 
the one hand the number of pages in the document 1 04 
and on the other hand the number of bytes in the docu- 
ment 104. 



[0096] Then an operation 306 consists of determining 
the necessary resolution by reading this information, 
whose unit is the number of points per unit length, in the 
file corresponding to document 104. 
[0097] Next, an operation 307 consists of determining 
the print medium, by reading this information in the file 
of the document 104. It should be noted here that this 
information is available only in documents generated by 
certain particular software packages. By default, that is 
to say where said information is not available in the file 
104, the paper medium is selected. 
[0098] Then the operation 308 consists of storing in 
the memory of the processor 100 (Figure 8), in a list of 
characteristics 309 associated with the document 104 
under consideration, in the document database 808 
containing the knowledge obtained during the opera- 
tions 301 to 307. 

[0099] Figure 4 depicts, in more detail, the knowledge 
processing operation 107. 

[01 00] A document database 808 including the quad- 
ruplets consisting of: 

identifiers of documents already printed, 
characteristics of these documents, 
the printer corresponding to the last printing, and 
the printer pilot configuration corresponding to the 
last printing, it being considered, the following oper- 
ations are successively performed, considering a 
document 104 and its own characteristics stored, in 
a list 309. 

[0101] During an operation 401, all the documents 
which have the same proportion of texts as the propor- 
tion of text stored in the list of characteristics 309, are 
selected. To this end, the proportions of texts are suc- 
cessively compared and, when their difference is less, 
in percentage terms, than a first predetermined thresh- 
old value, the document is selected. By way of example, 
a first threshold value of 1 5% can be used. A list of doc- 
uments 402 having the same proportion of text is thus 
constituted. 

[0102] Next, during an operation 403, all the docu- 
ment which have the same proportion of image as the 
proportion of image stored in the list of characteristics 
309 are selected. To this end, the image proportions are 
successively compared and when their difference is 
less, in percentage terms, than a second predetermined 
threshold value (for example 15%), the document is 
selected. A list of documents 404 having the same pro- 
portion of image is thus constituted. 
[0103] Then, during an operation 430, all the docu- 
ments which have the same proportion of graphics as 
the proportion of graphics stored in the list of character- 
istics 309 are selected. To this end, the proportions of 
graphics are successively compared and when their dif- 
ference is less, in percentage terms, than a third prede- 
termined threshold value (for example 15%), the 
document is selected. A list of documents 431 having 
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the same proportion of graphics is thus constituted. 
[0104] Then, during an operation 405, all the docu- 
ments which have the same colour palette as that 
stored in the list of characteristics 309 are selected. To 
this end, the colour palettes are successively compared 
and when they are identical the document is selected. A 
list of documents 406 having the same colour palette is 
thus constituted. 

[0105] Next, during an operation 407, all the docu- 
ment which have the same size as the size stored in the 
list of characteristics 309 are selected. To this end, the 
sizes are successively compared and when their differ- 
ence is less, in percentage terms, then a fourth prede- 
termined threshold value (for example 15%) the 
document is selected. A list of documents 408 having 
the same size is thus constituted. 
[0106] Then, during an operation 409, all the docu- 
ment which have the same resolution as the resolution 
stored in the list of characteristics 309 are selected. To 
this end, the resolutions are successively compared 
and when their difference is less, in percentage terms, 
than a fifth predetermined threshold value (for example 
15%) the document is selected. A list of documents 410 
having the same text resolution is thus constituted. 
[0107] Then, during an operation 411, all the docu- 
ments which have the same medium as the one stored 
in the list of characteristics 309 are selected. To this 
end. the information media are successively compared 
and, when they are identical, the document is selected. 
A list of documents 412 having the same print medium 
is thus constituted. 

[0108] For the purpose of clarity of the description, the 
determination of the lists of documents concerning the 
other characteristics, in particular those resulting from 
the dialogue 105, are not depicted in Figure 4. Each of 
them is effected in the same way as the list 412, select- 
ing all the documents which have, for said characteristic 
under consideration, the same value as that stored in 
the list of characteristics 309, comparing them succes- 
sively. Lists Of documents 440, 441 , 442, 443, 444, 445, 
446, and 447 are thus constituted, corresponding 
respectively to the responses to the question posed dur- 
ing operation 105: 

"rough draft or final document?" 
"colour of medium?" 
"proximity of printer required?" 
"printing time crucial?" 
"print format?" 
"reduction ratio required?" 
"printing on both sides or soiely on front face ?" 
and to the identification of the user. 

[0109] Next, during a test 413. the processor 100 (Fig- 
ure 8) determines whether the documents which belong 
to the largest number of lists (402, 404, 431. 406, 408, 
410. 412 and 440 to 447) all correspond to the same 
processing characteristics (choice of input/output inter- 



face and/or choice of processing pilot configuration). 
[01 1 0] To this end, starting from 1 5 (since there are fif- 
teen characteristics considered), the number of charac- 
teristics considered is progressively decremented and 

5 documents are sought which are situated in said 
number of lists (402, 404, 431 . 406. 408, 41 0 or 41 2 and 
440 to 447) until there is at least one of them. 
[01 11] When all the documents, here referenced 41 5, 
which belong to the largest number of lists, correspond 

w to the same choice of printer and printer configuration, 
this choice of printer and this configuration are repeated 
for printing the document under consideration, during 
operation 421. 

[01 12] When the result of test 413 is negative, that is 
15 to say when no document belongs to a list or when at 
least two documents which belong to the largest 
number of lists correspond either to different printers or 
to different printer configurations, an operation 414 con- 
sists of making a list, referenced 414, of the documents 
20 which belong to the largest number of lists, that is to say 
documents which respond to the largest number of the 
fifteen tests 401 . 403, 430, 405, 407, 409 or 41 1 and the 
corresponding tests, not shown. 

[0113] During a test 422, the processor 100 controls 

25 the display of a question on the screen, in order to 
determine whether or not the functioning of the device 
should continue in interactive mode. In the affirmative, 
that is to say when the user supplies a positive 
response, in order to determine, amongst the docu- 

30 ments in list 414, whether one can supply the process- 
ing characteristics (choice of printer and/or printer 
configuration), the user is requested to put in a hierar- 
chy the characteristics to which the lists in which not all 
the documents in list 414 are found relate. 

35 [0114] Thus, in order to resolve these conflicts, the 
device according to the invention asks the user the fol- 
lowing question, in which are withdrawn the characteris- 
tics which correspond to the lists in which there are 
either all the documents in list 414, or none of them: 7s 

40 it more important to favour word processing, graphics, 
images, the number of colours, the contrast of the col- 
ours, the speed of printing, the resolution of the print- 
ing?". 

[01 1 5] For example, if a list 41 4 contains only two doc- 
45 uments and if only the lists 402, 406 and 41 0 have only 
one of these two documents, the question posed is 
[01 1 6] 7s it more important to favour the processing of 
images, the number of colours, the contrast of the col- 
ours, the speed of printing and/or the resolution of the 
so printing, the proximity of the printer the printing time, the 
printing format the reduction ratio required, or printing 
on both sides?". 

[01 17] Next, preserving only the documents which are 
in the lists relating to a characteristic considered by the 
55 user as of prime importance, at least one document is 
selected during an operation 417. 
10118] Then, during a test 418, it is determined 
whether or not the conflicts are resolved, that is to say if 
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there remains more than one document kept. 
[0119] In the affirmative, operation 415 is performed. 
In the negative, a test 419 determined whether the user 
can refine the order, that is to say if it is possible to sort 
the documents still further or if they all form part of the s 
same lists. 

[0120] When the result of test 419 is positive, opera- 
tion 41 6 is reiterated. When the result of test 41 9 is neg- 
ative, during an operation 420, the document 104 to be 
printed is treated as a new document, that is to say the 10 
user is requested to choose the document processing 
characteristics, namely, here: 

the printer which is intended to print it, on the one 
hand, and is 
the configuration of this printer on the other hand. 

[01 21 ] Thus, during operation 416, the characteristics 
which are selected by the user allow a new evaluation of 
the documents in list 414, which are in conflict, and 20 
next, where applicable, the choice of one of these docu- 
ments whose processing characteristics (choice of 
input/output interface and/or choice of processing pilot 
configuration) will be attributed to document 104 in the 
course of processing. 25 
[0122] Following operation 420, the knowledge con- 
cerning this document is associated with the document 
represented by the list of characteristics 309, in the doc- 
ument database, during the operation 421 . 
[0123] It will be observed that, when the result of test 30 
422 is negative, operation 421 is directly performed, 
with the processing parameters fixed, by default, during 
the configuration of the system or software. 
[0124] In Figure 5, operation 301 (Figure 3) is 
detailed. A document 104 being considered, test 501 35 
consists of determining whether this document results: 

from the software suite sold under the trade name 
"MICROSOFT", known as "MS OFFICE" and 
including notably the software "Word", "Powerpoint" 40 
and "Excel", box 503, 

from the software suite sold by the American com- 
pany "ADOBE" containing notably the software 
"Framemaker", "Photoshop" and "Acrobat", box 
502, or 45 
from another software, box 504. 

[0125] To this end, the file name extension, its system 
signature and its properties are analysed using known 
techniques taking into account the structure of the files so 
and the meaning of each of its fields. 
[0126] In the first case, represented in box 503, test 
505 determines whether or not the document consid- 
ered comes from the wordprocessor M Word H . When the 
result of test 505 is positive, the document has a char- 55 
acteristic of being principally composed of text, the 
characteristic depicted in box 506. 
[0127] When the result of test 505 is negative, test 507 
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determines whether or not the document under consid- 
eration comes from the "Powerpoint" presentation soft- 
ware. When the result of test 507 is positive, the 
document has the characteristic of being principally 
composed of graphics, the characteristic depicted in 
box 508. 

[01 28] When the result of test 507 is negative, test 509 
determines whether or not the document under consid- 
eration comes from the "Excel" spreadsheet. When the 
result of test 509 is positive, the document has the char- 
acteristic of principally composed of text and graphics, 
the characteristic being depicted in box 510. 
[01 29] Where test 501 determines that the document 
under consideration comes from the "Adobe" suite, the 
case depicted in box 502, test 512 determines whether 
or not the document under consideration comes from 
the "Framemaker" desktop publishing software. When 
the result of test 512 is positive, the document has the 
characteristic of being principally composed of text, the 
characteristic depicted in box 513. 
[01 30] When the result of test 51 2 is negative, test 51 4 
determines whether or not the document under consid- 
eration comes from the "Photoshop" image processing 
software. When the result of test 514 is positive, the 
document has the characteristic of being principally 
composed to graphics, the characteristic depicted in 
box 515. 

[01 31 ] When the result of test 51 4 is negative, the test 
516 determines whether or not the document under 
consideration comes from the "Acrobat" wordprocess- 
ing software. When the result of test 516 is positive, the 
document has the characteristic of being principally 
composed of texts and graphics, a characteristic 
depicted in box 517. It should be noted here that, during 
operation 308, the characteristic determined is stored in 
the list of characteristics 309. 

[0132] Finally, when test 501 determines that the doc- 
ument comes from another software package, the case 
depicted in box 504, operation 302 (Figure 3) is per- 
formed directly. Following a positive result to one of 
tests 505, 507, 509, 512, 514 or 516 or following a neg- 
ative result to one of the tests 509 or 516, operation 302 
is performed. 

[01 33] Figure 6 details the operation 1 09 disclosed in 
Figure 1 . A pilot configuration being considered, opera- 
tion 601 consists of seeking in memory the structure 
corresponding to the register of the printer pilot under 
consideration. Next, test 602 consists of determining 
whether or not the printer pilot is up to date. To this end, 
the current version is compared with the last version on 
the pilot database, according to known techniques. 
[0134] When the result of test 602 is negative, an 
operation 605 of updating the pilot is performed, replac- 
ing the pilot already installed with its new version. When 
the result of test 602 is positive or after the operation 
605, operation 603 consists of recording, in the pilot 
registers, the parameters of the required configuration, 
and the references of the new version of the pilot. The 
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pilot is then configured (box 604). 

[0135] Figure 7 details the operation 101 illustrated in 

Figures 1 and 6. 

[0136] Operation 701 consists of determining the ver- 
sion of the current pilot, according to known techniques. 
[0137] Operation 702 next consists of interrogating 
the pilot database, situated at a distance, in order to 
obtain therefrom, in return, the date of the most recent 
pilot version (operation 703). 

[0138] Operation 704 consists, when the date of the 
most recent version is different from the date of the cur- 
rent version, of gaining access to the pilot server. Oper- 
ation 102 (Figure 1) then consists of downloading the 
required pilot and installing the pilot, according to known 
techniques which are not detailed here. 
[0139] Figure 8 depicts the main components of an 
electronic device 801 implementing the present inven- 
tion. This device 801 has an architecture which is 
known in the field of programmable electronic systems, 
based on the use of components connected together by 
a bus 802 and a central unit 800 controlled by a pro- 
gram, a simplified algorithm of which was presented 
with regard to Figures 1 to 7. 

[0140] For example, these elements are conjointly 
associated in a personal computer 100, of a known 
type, for example of the type functioning with a PEN- 
TIUM microprocessor 800 from the American company 
INTEL which has at least a random access memory, a 
read only memory, a screen 806, a mouse 810 and a 
keyboard 81 1 . 

[0141] An input/output port 803 receives the binary 
information coming from the user, by means of a key- 
board 81 1, a mouse 810, a touch screen (not shown) or 
any other communication means, and transmits them, 
under the control of the central unit 800, to a random 
access memory RAM 804. In addition, the input/output 
port 803 transmits, under the control of the central unit 
800: 

- to one of the printers 807 and 809, the data frames 
intended to control the printer selected in order to 
print the document 104, and 
to the screen 806, the data intended to be dis- 
played. 

[0142] The random access memory RAM 804, of a 
known type, contains registers intended to receive 
parameters, variables, digital data and intermediate 
processing values, as well as the databases mentioned 
above. 

[0143] A read only memory ROM 805, of a known 
type, stores the program which enables the device to 
operate and, in particular, for the central unit 800. The 
central unit 800, of a known type, for example consisting 
of a microcontroller, controls the functioning of the main 
components of the information transmission device. 
[0144] In the second embodiment described and 
depicted with regards to figs 9 to 1 1B, the file process- 



ing consists of an application for printing data repre- 
sented by this file. The invention also applies, however, 
to any data storage processing, data transmission 
processing or data display processing, with or without 

5 data compression. 

[0145] Figure 9 depicts the main components of an 
electronic device 901 implementing the present inven- 
tion. This device 901 has an architecture which is 
known in the field of programmable electronic systems, 

10 based on the use of components connected together by 
a bus 902 and of a central unit 900 controlled by a pro- 
gram, a simplified algorithm of which is presented in 
connection with Figure 10. 

[0146] For example, these elements are conjointly 
15 associated in a personal computer, of a known type, for 
example of a type operating with a PENTIUM 900 (reg- 
istered trade mark) microprocessor from the American 
company INTEL (registered trade mark), which contains 
at least one random access memory 904, a non-volatile 
20 memory 905, a screen 906, a mouse 910 and a key- 
board 911. 

[0147] An input/output port 903 receives the digital 
information coming from the user, by means of the key- 
board 91 1 , the mouse 910, a tactile screen (not shown) 
25 or any other communication means, and transmits it, 
under the control of the central unit 900. to a random 
access memory RAM 904. In addition, the input/output 
port 903 transmits, under the control of the central unit 
900: 

30 

to one of the printers 907, which is a monochrome 
laser printer, or 909, which is an ink jet printer using 
different operating modes, some of which are 
adapted to monochrome printing and others to 
35 polychrome printing, the data frames intended to 
control the printer selected in order to print each 
page of the document to be printed, and 
to the screen 906, the data intended to be dis- 
played. 

40 

[0148] The random access memory RAM 904, of a 
known type, contains registers intended to receive 
parameters, variables, digital data and intermediate 
processing values, as well as the data bases mentioned 
45 above. 

[0149] The non-volatile memory 905, of a known type, 
stores the program which enables the device and, in 
particular, the central unit 900, to operate. The central 
unit 900, of a known type, for example consisting of a 

so microcontroller, controls the operation of the principal 
components of the information transmission device. 
[01 50] In Figure 10, operations and tests successively 
performed by the central unit 900 for implementing the 
present invention can be seen. 

55 [0151] During an operation 1001 . the central unit ini- 
tialises the device illustrated in Figure 9, by implement- 
ing, for example, the known procedures for starting up 
the WINDOWS 95 (registered trade mark) operating 
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system. 

[0152] During an operation 1002, the user enters an 
instruction for printing a file "file" stored in the non-vola- 
tile memory 905, using the mouse 910 or keyboard 91 1 . 
[0153] During an operation 1003, the central unit 900 
determines a first quantity related to the document 
under consideration: the number of pages represented 
by the file "fife". To this end, the central unit 900 
accesses a segment of the file "file" where this informa- 
tion is disposed, or, according to a variant, opens an 
application able to permit the printing of the said file, for- 
mats the said file and collects the number of pages 
determined by the said application. 
[0154] During an operation 1004, the central unit 900 
determines a second quantity related to the document 
under consideration: the size of the file "fife", in num- 
bers of bytes. To this end, the centra! unit 900 access a 
segment of the file "file" where this information is dis- 
posed, or, according to a variant, reads the file "file" and 
counts the number of bytes read. 
[0155] During an operation 1005, the central unit 
determines a first index of the content of the file "file" by 
comparing the ratio of the number of pages represented 
by the file "file" to the number of bytes in this file, on the 
one hand, with two predetermined thresholds on the 
other hand. 

[0156] This is because the inventor has discovered 
that the said ratio is low, and generally less than 0.01 for 
a file content including images, between 0.01 and 0.1 
for contents of a file including graphics and texts, and 
greater than 0.1 for contents of a file including only 
texts. 

[0157] The content thus determined is stored in the 
register "content" of the random access memory 904. 
[0158] During an operation 1006, the central unit 900 
re-zeros a counter "duration" of the random access 
memory 104, initiates the incrementation of the counter 
"duration" by the clock (not shown) of the computer sys- 
tem and starts the functioning of data compression soft- 
ware acting on the file "file" in its initial form. In the 
embodiment described and depicted, the compression 
algorithm is the algorithm "zip\ implemented in the well 
known software packages "winzip", and "GZIP" availa- 
ble in the main so-called "public domain" software data 
bases (software packages known to persons skilled in 
the art under their generic English term "freeware" or 
"shareware"). 

[0159] During an operation 1007, the central unit 900 
completes the compression of the file "file" and stops 
the incrementation of the file "duration". 
[0160] The operations 1006 and 1007 thus make it 
possible to determine two quantities related to the doc- 
ument under consideration: a degree of compression 
and a period necessary for compression. 
[0161] During an operation 1008. the central unit 900 
determines a second index of the content of the file 
"file" by comparing the degree of compression obtained 
using the data compression software on the file "fr/e" on 



the one hand, with two predetermined thresholds on the 
other hand. 

[0162] This is because the inventor has discovered 
that the said degree is low, and, for example with the 
s algorithm of the zip compression software, is generally 
below 0.20 for a file content including images, between 
0.20 and 0.45 for file contents including graphics and 
texts, and greater than 0.45 for file contents including 
only texts. 

10 [0163] The content thus determined is stored in the 
register "content" of the random access memory 904, 
without erasing the content index already stored. 
[0164] Then, during an operation 1009, the central 
unit 900 determines a third content index of the file "file" 

15 by comparing the time necessary for compression, a 
time obtained by using the data compression software 
on the file "file" on the one hand, with two predeter- 
mined thresholds on the other hand. 
[01 65] This is because the inventor discovered that 

20 the said time is relatively short, and generally less than 
1 0 seconds for a file content including images, between 
10 and 30 seconds for file contents including graphics 
and texts and greater than 30 seconds for file contents 
which include only texts. 

25 [0166] In order to determine the values given above, 
as well as with regard to the operation 1012, the inven- 
tors used the following computer configuration: 

a so-called "IBM PC compatible" computer contain- 
30 ing an INTEL PENTIUM processor, with a 100 meg- 
ahertz clock speed (IBM, INTEL PENTIUM are 
registered trade marks), and a 16 megabyte ran- 
dom access memory RAM, 
using the WINDOWS 95 operating system sold by 
35 MICROSOFT (WINDOWS, WINDOWS 95 AND 

MICROSOFT are registered trade marks), and 
document sets commonly referred to as "Bench- 
marks". 

40 [0167] The content thus determined is stored in the 
register "content" of the random access memory 904, 
without erasure of the content indices already stored. 
[0168] Then, during an operation 1010, the central 
unit 900 re-zeros a counter "duration" of the random 

45 access memory 904, initiates the incrementation of the 
counter "duration" by the clock (not shown) of the com- 
puter system and starts the operation of an integrated 
software package (word processing, graphics software) 
in order to open the file "file", that is to say to make the 

so content thereof accessible to the user. 

[0169] During an operation 101 1 , the central unit 900 
completes the opening of the file "file" and stops the 
incrementation of the file "duration". 
[0170] The operations 1010 and 1011 thus make it 

55 possible to determine a quantity related to the docu- 
ment under consideration: a time of opening of the doc- 
uments by a predetermined application. 
[0171] During an operation 1012, the central unit 900 
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determines a fourth content index for the file "file" by 
comparing the opening time obtained using the inte- 
grated software on the file "file" on the one hand, with 
two predetermined thresholds on the other hand. 
[0172] This is because the inventor discovered that 
the said time is small, and, for example with the inte- 
grated software known as "WORD" (registered trade 
mark) sold under the brand name "MICROSOFT", it is 
generally less than 30 seconds for a file content includ- 
ing images, between 30 seconds and 1 minute for file 
contents including graphics and texts and greater than 1 
minute for file contents including only texts. 
[0173] The content thus determined is stored in the 
register "content" of the random access memory 904, 
without erasure of the content indices already stored. 
[0174] During a test 1013, the central unit 900 deter- 
mines whether or not the content of the file "file" can be 
determined, seeking whether at least three of the four 
content indices represent the same content When the 
result of the test 1013 is positive, during an operation 
1014, the content of the file is deemed to be identical to 
the identical value for at least three of the content indus- 
tries. 

[0175] Then the choice of the printer which is most 
suited to the printing of the assumed content of the file 
is made, operation 1015, and the configuration of the 
pilot of this printer is carried out. operation 1016, taking 
account of the content of the file "file". To this end, when 
each printer is installed, a question is posed to the user 
for the latter to indicate, according to known procedures 
for dialogue with the device, which printer and which 
printer configuration is the most suited to each docu- 
ment content. 

[0176] Next, the document is printed by the said 
printer in the said configuration, and then the central 
unit returns to the operation 1 001 . 
[0177] When the result of the test 1013 is negative, 
during the first iteration of an operation 1018, the central 
unit 900 determines parts of a document, these parts 
here being pages, and takes into consideration the first 
part of this document. During each following iteration of 
the operation 1018, it is the page which is the one which 
was previously considered which is, in its turn, taken 
into consideration. 

[0178] Next, during the test 1019, the central unit 900 
determines whether or not the last page has been con- 
sidered. When the result of the test 1019 is positive, the 
operation 1001 is reiterated. When the result of the test 
1019 is negative, during an operation 1020. for the page 
under consideration of the document represented by 
the file "file", the possible contents are reinitialised, and, 
during the single operation 1020. operations identical to 
the operation 1004 to 1012 are performed on the page 
under consideration. 

[0179] During a test 1021, the central unit 900 deter- 
mines whether or not the content of the page under con- 
sideration can be determined, seeking whether or not at 
least three of the four content indices of the said page 



have the same value. 

[0180] When the result of the test 1021 is positive, 
during an operation 1022, the content of the file is 
deemed to be identical to the identical value of these 
5 three indices. Then, during an operation 1023, the cen- 
tral unit 900 chooses the printer which is best suited to 
the printing of the assumed content of the page under 
consideration. 

[0181 ] Next, the central unit 900 configures the pilot of 

w this printer which is most suited to the printing of the 
page under consideration, operation 1024, and then the 
central unit initiates the printing of the page under con- 
sideration with the said printer and the said configura- 
tion, operation 1025, and returns to the operation 1019. 

15 [0182] When the result of the test 1021 is negative, for 
the page under consideration, the printer is chosen by 
default, operation 1026, the configuration is chosen by 
default, operation 1027, and the page under considera- 
tion is printed, operation 1028. Then the operation 1019 

20 is reiterated. 

[0183] It will be observed here that the operations 
1003. 1004, 1006. 1007. 1010. 1011 and 1020 may 
include a scrutinising of all the data of the document 
which are directly accessible to the user, without having 

25 to use a software application able to allow the user to 
modify neither the data nor the content of the said doc- 
ument. 

[0184] According to an embodiment of the invention 
which is not shown, the processing performed on the 

30 data of the document is a data compression, the com- 
pression algorithm being chosen as a function of the 
content of the document. It is known, in fact, that some 
document compression algorithms (for example with 
discreet Furrier transform) are more effective for images 

35 whilst other compression algorithms (for example of the 
"zip" type disclosed above) are more effective for texts 
and drawings. 

[0185] Figure 1 1 A depicts a flow diagram for deter- 
mining thresholds liable to be used in the flow diagram 

40 illustrated in Figure 1 0, this flow diagram being intended 
to supply thresholds and other data stored in the non- 
volatile memory of the device illustrated in Figure 9. 
[0186] After an operation 1 101 of initialising a device 
as illustrated in Figure 9. during an operation 1 102. the 

45 following document, in a predetermined list of docu- 
ments, is considered, knowing that, at the first iteration 
of the operation 1 102, it is the first document in this list 
which is considered. 

[0187] During an index determination operation 1 103, 
so the four content indices are determined, as disclosed 
above. 

[0188] During an operation 1 104, the central unit 900 
causes the display, on the screen 906, of the question: 
"Is the content of this document mainly text, 
55 drawing or image?", 

then stores the response given, by the user, by 
means of the keyboard 91 1 or mouse 910, in a known 
fashion, associated with the value of the four content 
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indices. 

[0189] During a test 11 05, the central unit 900 deter- 
mines whether or not the last document in the list has 
been considered. 

[0190] When the result of the test 1105 is negative, 
the operation 1102 is reiterated. When the result of the 
test 1105 is positive, during an operation 1106, in suc- 
cession: 

for all the documents whose content, given during 
the operation 1104, is of the "text"' type, for each 
index, the means of the values of the said index are 
determined and given a weighting coefficient equal 
to the number of documents in question, 
for all the documents whose content, given during 
the operation 1104, is of the "drawing" type, for 
each index, the means of the values of the said 
index are determined and given a weighting coeffi- 
cient equal to the number of documents in question, 
and 

for all the documents whose contents, given during 
the operation 1 1 04, is of the "image" type, for each 
index, the means of the values of the said index are 
determined and given a weighting coefficient equal 
to the number of documents in question. 

[0191] During an operation 1107, for each index, the 
means are put in their ascending order and the first 
threshold is determined at the median value between 
the two lowest mean values whereas a second thresh- 
old is determined as being equal to the median value 
between the two highest mean values. 
[0192] For example, if, for the first index, the following 
documents have been given a value given in the follow- 
ing table: 



Document content 


First index value 


text 


0.1 


text 


0.2 


text 


0.3 


drawing 


0.05 


drawing 


0.03 


drawing 


0.07 


image 


0.005 


image 


0.003 


image 


0.001 



whereas the means of the values of the first index, and 
their weighting, for each type of content, are: 



content 


mean 


weighting 


text 


0.2 


3 


drawing 


0.05 


3 


image 


0.003 


3 



10 

and the thresholds are equal to: 

(3 x 0.2 + 3 x 0.05 )/6 = 0.125 (threshold which sep- 
arates the text from the drawings) and 
is (3 x 0.05 + 3 x 0.003)/6 = 0.0265 (threshold which 
separates the drawings from the images). 

[0193] During an operation 1108, the values of the 
means of the indices, weighting coefficients and thresh- 
20 olds are stored. 

[0194] Then the operation of the device is interrupted, 
operation 1 1 09. 

[0195] Figure 1 1 B depicts a threshold correction flow 
diagram, taking into account the content defects indi- 

25 cated by the user of the device illustrated in Figure 9. 
[0196] After an operating sequence illustrated in Fig- 
ures 10A and 10B, operation 1 1 10, when the user uses 
the keyboard for indicating that a content error has been 
made, using the keyboard 91 1 or mouse 910, during an 

30 operation 1 1 1 1 , the central unit 900 causes the display, 
on the screen 906 of the question: 

7s the content of this document mainly text, draw- 
ing or image?", 

35 

then stores the response given, by the user, by 
means of the keyboard 91 1 or mouse 910 in a known 
fashion, associated with the value of the four content 
indices. 

40 [0197] Next, during an operation 1112, new means, 
new weighting coefficients and new thresholds are 
determined and stored, as disclosed above, with regard 
to operations 1 1 06 to 1 1 08. 

[0198] It will be understood that the correction mode 
45 illustrated in Figure 1 1 B, which takes into account the 
errors committed by the device according to the inven- 
tion when it implements the method of the invention with 
the thresholds which have been determined, at the 
manufacturer of the patented products, as illustrated in 
bo Figure 1 1 A, makes it possible to improve the determina- 
tion of the content of the document. 
[01 99] The scope of the invention is not limited to the 
embodiments described and depicted but quite the con- 
trary extends to any modifications and improvements 
55 within the capability of persons skilled in the art. 
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Claims 

1. Device (801) for determining conditions for 
processing liable to be carried out on data in a file 
(104), by at least one input/output means (111, 807, s 
809) which modulates a physical quantity, charac- 
terised in that it has: 

a means (800, 804, 805) of determining the 
semantics of the processing of the said data, 10 
and 

a configuration determination means (800. 
804, 805) adapted, without modifying the said 
data, to take into account the semantics of the 
said processing of the said data in order to 15 
determine the configuration of the pilot of the 
input/output means designed to implement this 
processing. 

2. Device according to Claim 1 , characterised in that: 20 

the processing is able to be carried out by at 
least two input/output means (807, 809), and 
the pilot configuration used by the determina- 
tion means (800. 804, 805) includes the selec- 25 
tion of the input/output means intended to 
implement the said processing. 

3. Device according to either one of claims 1 or 2, 
characterised in that: 30 

- it has a means (800, 804, 805, 806, 810, 811) 
of dialoguing with the user adapted to transmit 
questions to the user and to receive informa- 
tion from him in response, and 35 
the configuration determination means (800, 
804, 805) is also adapted to take into account 
the information received in response from the 
user in order to determine the pilot configura- 
tion. 40 

4. Device according to Claim 3, characterised in that it 
has a memory (804) adapted to store information 
received in response from the user. 

45 

5. Device according to Claim 4, characterised in that 
the said memory (804) is adapted also to store, 
associated with each item of information received, 
an item of information representing the user who 
supplied it. 50 

6. Device according to either one of Claims 4 or 5, 
characterised in that the said memory (804) is 
adapted also to store, associated with each item of 
information received, an item of information repre- 55 
senting the document concerned. 

7. Device according to any one of Claims 1 to 6, char- 



acterised in that it has a pilot updating means (800, 
804, 805) adapted: 

on the one hand to detect that a pilot of the 
input/output means intended to implement the 
said processing is not available or is not up to 
date in a memory (804), and 
on the other hand, to read the said pilot in 
another memory. 

8. Device according to any one of Claims 1 to 7, char- 
acterised in that: 

- it has a means (800, 804, 805) of checking the 
availability of input/output means adapted to 
transmit an item of information representing 
unavailability for the processing of the said 
data, when the means (807) intended to proc- 
ess the said data is not available for this pur- 
pose; and 

the determination means (800, 804, 805) is 
adapted to take into account the unavailability 
information in order to determine the configura- 
tion of another input/output means (809) able 
to implement the said processing of the said 
data. 

9. Device according to anyone of claims 1 to 8 for 
determining the conditions of a processing liable to 
be carried out on data of a document, by at least 
one input/output means (907, 909) which uses a 
physical quantity, characterised in that it has: 

a means of determining at least two quantities 
related to the said document (900, 904, 905), 
a means of estimating the content of this docu- 
ment (900, 904, 905), adapted to take into 
account each said quantity, and 
a configuration determination means (900, 
904, 905) adapted to take into account the con- 
tent of the document in order to determine the 
configuration of the pilot of the input/output 
means intended to implement this processing. 

10. Device according to Claim 9, characterised in that 
the quantity determination means is adapted to 
scrutinise data of the said document which are 
directly accessible to a user. 

11. Device according to either one of Claims 9 or 10, 
characterised in that the quantity determination 
means is adapted to function without using a soft- 
ware application able to make it possible to modify 
the content of the said document. 

12. Device according to any one of Claims 9 to 11. 
characterised in that the quantity determination 
means is adapted to read at least one quantity in a 
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file. 

13. Device according to any one of Claims 9 to 12, 
characterised in that: 

5 

the processing can be carried out by at least 
two input/output means (907, 909) and 
the pilot configuration determination means is 
adapted to select an input/output means 
designed to implement the said process. w 

14. Device according to any one of Claims 9 to 13, 
characterised in that the quantity determination 
means is adapted so that one of the said quantities 
represents the number of pages in the document is 
represented by the said document. 

15. Device according to any one of Claims 9 to 14, 
characterised in that the quantity determination 
means is adapted so that one of the said quantities 20 
represents the number of digital information items 

in the said document. 

16. Device according to any one of Claims 9 to 15, 
characterised in that the quantity determination 25 
means is adapted so that one of the said quantities 
represents a time taken to open the said document 

by software able to modify the content thereof. 

17. Device according to any one of Claims 9 to 16, 30 
characterised in that the quantity' determination 
means is adapted so that one of the said quantities 
represents a degree of compression able to be 
obtained on the said document using predeter- 
mined compression software. 35 

18. Device according to any one of Claims 9 to 17, 
characterised in that quantity determination means 
is adapted so that one of the said quantities repre- 
sents a period of time necessary for the compres- 40 
sion of the said document, by predetermined 
compression software. 

19. Device according to any one of Claims 9 to 18, 
characterised in that it includes a means of deter- 45 
mining parts (900, 904, 905) of the said document, 
and in that: 



tion of the pilot of the input/output means 
intended to implement this processing on the 
said part. 

20. Printer, characterised in that it has a device accord- 
ing to any one of Claims 1 to 1 9. 

21. Facsimile machine, characterised in that it has a 
device according to any one of Claims 1 to 1 9. 

22. Modulator-demodulator, characterised in that it has 
a device according to any one of Claims 1 to 19. 

23. Display screen, characterised in that it has a device 
according to any one of Claims 1 to 19. 

24. Photographic apparatus, characterised in that it has 
a device according to any one of Claims 1 to 19. 

25. Camera having an image sensor, characterised in 
that it includes a device according to any one of 
Claims 1 to 19. 

26. Method for determining conditions for processing 
liable to be carried out on data in a file (104), by at 
least one input/output means (111, 807. 809) which 
modulates a physical quantity, characterised in that 
it has: 

a step (105, 106) of determining the semantics 
of the processing of the said data, and 
a configuration determination step (107) during 
which, without modifying the said data (104), 
the semantics of the said processing of the said 
data is taken into account in order to determine 
the configuration of the pilot of the input/output 
means designed to implement this processing. 

27. Method according to Claim 26, characterised in 
that: 

the processing is able to be carried out by at 
least two input/output means (807, 809), and 
the pilot configuration determination operation 
(107) includes a step of selection of the 
input/output means intended to implement the 
said processing. 

28. Method according to either one of Claims 26 or 27, 
characterised in that: 

it includes an operation (1 05) of dialoguing with 
the user during which questions are transmit- 
ted to the user and information is received from 
him in response, and 

the configuration determination operation (107) 
also includes taking into account the informa- 
tion received in response from the user in order 



the determination means is adapted to deter- 
mine at least two quantities relating to each of so 
the said parts, 

the content estimation means is adapted to 
estimate the content of each of the said parts, 
taking into account each said quantity relating 
to the said part, and 55 
the configuration determination means is 
adapted to take into account the content of the 
said part in order to determine the corrfigura- 
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to determine the pilot configuration. 

29. Method according to Claim 28. characterised in that 
it includes a step (105. 207) of storing information 
received in response from the user. 

30. Method according to Claim 29, characterised in that 
the said storage step (207) also includes associat- 
ing, with each item of information received, and 
item of information representing the user who sup- 
plied it. 

31 . Method according to either one of Claims 29 or 30, 
characterised in that the said storage step (207) 
also includes associating, with each item of infor- 
mation received, an item of information represent- 
ing the document concerned. 

32. Method according to any one of Claims 26 to 31, 
characterised in that it includes a pilot updating step 
(101 , 102, 602. 605) during which. 

- on the one hand it is detected that a pilot of the 
input/output means intended to implement the 
said processing is not available or is not up to 
date in a memory, and 

- on the other hand, the said pilot is read in 
another memory. 

33. Method according to any one of Claims 26 to 32, 
characterised in that: 

it includes an operation of checking the availa- 
bility of input/output means (807, 809) during 
which an item of information representing una- 
vailability for the processing of the said data is 
transmitted, when the means (807) intended to 
process the said data is not available for this 
purpose; and 

during the determination operation (107), the 
unavailability* information is taken into account 
in order to determine the configuration of 
another input/output means (809) able to 
implement the said processing of the said data. 

34. Method according to anyone of claims 26 to 33. for 
determining the conditions of a processing liable to 
be carried out on data of a document, by at least 
one input/output means (907, 909) which uses a 
physical quantity, characterised in that it includes: 

an operation of determining at least two quanti- 
ties related to the said document (1003. 1004, 
1006. 1007, 1010. 1011. 1020) 
an operation of estimating the content of this 
document, taking into account each said quan- 
tity (1005, 1008, 1009, 1012. 1013. 1014, 
1020, 1022), and 



a configuration determination operation (1015, 
1016, 1023, 1024) during which the content of 
the document is taken into account in order to 
determine the configuration of the pilot of the 
input/output means intended to implement this 
processing. 

35. Method according to Claim 34, characterised in that 
the quantity' determination operation includes an 

o operation of scrutinising (1003, 1004, 1006, 1007, 
1010, 1011, 1020) the data of the said document 
which are directly accessible to a user. 

36. Method according to either one of Claims 34 or 35. 
(5 characterised in that the quantity determination 

operation (1003, 1004, 1006, 1007, 1020) can be 
carried out without having to use a software appli- 
cation able to make it possible to modify the content 
of the said document. 

20 

37. Method according to any one of Claims 34 to 36, 
characterised in that the quantity' determination 
operation includes an operation (1003, 1004, 1007) 
of reading at least one quantity in a file. 

25 

38. Method according to any one of Claims 34 to 37, 
characterised in that: 

the processing is able to be carried out by at 
30 least two input/output means (907, 909) and 

the pilot configuration determination operation 
includes an operation (1015, 1023) of selecting 
the input/output means intended to implement 
the said processing. 

35 

39. Method according to any one of Claims 34 to 38, 
characterised in that, during the quantity determi- 
nation operation (1003), one of the said quantities 
represents the number of pages in the document 

40 represented by the said document. 

40. Method according to any one of Claims 34 to 39, 
characterised in that, during the quantity determi- 
nation operation (1004), one of the said quantities 

45 represents the number of digital information items 
in the said document. 

41. Method according to any one of Claims 34 to 40, 
characterised in that during the quantity determina- 

so tion operation (1010, 101 1) one of the said quanti- 
ties represents a time taken to open the said 
document by software able to modify the content 
thereof. 

55 42. Method according to any one of the Claims 34 to 
41 , characterised in that during the quantity deter- 
mination operation (1006. 1007) one of the said 
quantities represents a degree of compression able 
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to be obtained on the said document by using pre- 
determined compression software. 

43. Method according to any one of the Claims 34 to 
42, characterised in that during the quantity deter- s 
mination operation (1006, 1007) one of the said 
quantities represents a time necessary for the com- 
pression of the said document, by predetermined 
compression software. 

10 

44. Method according to any one of Claims 34 to 43, 
characterised in that it includes an operation (1017, 
1025) of printing the said document. 

45. Method according to any one of Claims 34 to 43, is 
characterised in that it includes an operation of 
compressing the data representing the said docu- 
ment. 

46. Method according to any one of Claims 34 to 45, 20 
characterised in that it includes an operation of 
determining parts of the document (1018) and, for 
each of the said parts: 

an operation of determining at least two quanti- 25 

ties related to the said part (1020). 

an operation of estimating the content of the 

said part (1021, 1022), taking into account 

each quantity related to the said part, and 

a configuration determination operation (1023, 30 

1024) during which the content of the said part 

is taken into account in order to determine the 

configuration of the pilot of the input/output 

means intended to implement this processing 

on the said part. 35 

47. Method according to Claim 46, characterised in 
that, during the part determination operation 
(1018), pages of the document are determined. 

40 
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